Attentive deep neural networks for legal document retrieval

نویسندگان

چکیده

Legal text retrieval serves as a key component in wide range of legal processing tasks such question answering, case entailment, and statute law retrieval. The performance depends, to large extent, on the representation text, both query documents. Based good representations, model can effectively match its relevant Because documents often contain long articles only some parts are queries, it is quite challenge for existing models represent In this paper, we study use attentive neural network-based document We propose general approach using deep networks with attention mechanisms. it, develop two hierarchical architectures sparse sentences articles, name them Attentive CNN Paraformer. methods evaluated datasets different sizes characteristics English, Japanese, Vietnamese. Experimental results show that: (i) substantially outperform non-neural terms across languages; (ii) Pretrained transformer-based achieve better accuracy small at cost high computational complexity while lighter weight achieves datasets; (iii) Our proposed Paraformer outperforms state-of-the-art COLIEE dataset, achieving highest recall F2 scores top-N task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Surgical video retrieval using Deep Neural Networks

Although the amount of raw surgical videos, namely videos captured during surgical interventions, is growing fast, automatic retrieval and search remains a challenge. This is mainly due to the nature of the content, i.e. visually non-consistent tissue, diversity of internal organs, abrupt viewpoint changes and illumination variation. We propose a framework for retrieving surgical videos and a p...

متن کامل

Convolutional Deep Neural Networks for Document-Based Question Answering

Document-based Question Answering aims to compute the similarity or relevance between two texts: question and answer. It is a typical and core task and considered as a touchstone of natural language understanding. In this article, we present a convolutional neural network based architecture to learn feature representations of each questionanswer pair and compute its match score. By taking the i...

متن کامل

Deep Neural Networks for Czech Multi-label Document Classification

This paper is focused on automatic multi-label document classification of Czech text documents. The current approaches usually use some preprocessing which can have negative impact (loss of information, additional implementation work, etc). Therefore, we would like to omit it and use deep neural networks that learn from simple features. This choice was motivated by their successful usage in man...

متن کامل

Text Document Retrieval by Feed-forward Neural Networks

The paper deals with text document retrieval from the given document collection by using neural networks, namely cascade neural network, linear and nonlinear Hebbian neural networks and linear autoassociative neural network. With using neural networks it is possible to reduce the dimension of the document search space with preserving the highest retrieval accuracy.

متن کامل

Document Binarization Combining with Graph Cuts and Deep Neural Networks

Most data mining applications on collections of historical documents require binarization of the digitized images as a pre-processing step. Historical documents are often subjected to degradations such as parchment aging, smudges and bleed through from the other side. The text is sometimes printed, but more often handwritten. Mathematical modeling of the appearance of the text, as well as the b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Artificial Intelligence and Law

سال: 2022

ISSN: ['0924-8463', '1572-8382']

DOI: https://doi.org/10.1007/s10506-022-09341-8